adapter method
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning
Poth, Clifton, Sterz, Hannah, Paul, Indraneil, Purkayastha, Sukannya, Engländer, Leon, Imhof, Timo, Vulić, Ivan, Ruder, Sebastian, Gurevych, Iryna, Pfeiffer, Jonas
We introduce Adapters, an open-source library that unifies parameter-efficient and modular transfer learning in large language models. By integrating 10 diverse adapter methods into a unified interface, Adapters offers ease of use and flexible configuration. Our library allows researchers and practitioners to leverage adapter modularity through composition blocks, enabling the design of complex adapter setups. We demonstrate the library's efficacy by evaluating its performance against full fine-tuning on various NLP tasks. Adapters provides a powerful tool for addressing the challenges of conventional fine-tuning paradigms and promoting more efficient and modular transfer learning. The library is available via https://adapterhub.ml/adapters.
Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification
Razuvayevskaya, Olesya, Wu, Ben, Leite, Joao A., Heppell, Freddy, Srba, Ivan, Scarton, Carolina, Bontcheva, Kalina, Song, Xingyi
Adapters and Low-Rank Adaptation (LoRA) are parameter-efficient fine-tuning techniques designed to make the training of language models more efficient. Previous results demonstrated that these methods can even improve performance on some classification tasks. This paper complements the existing research by investigating how these techniques influence the classification performance and computation costs compared to full fine-tuning when applied to multilingual text classification tasks (genre, framing, and persuasion techniques detection; with different input lengths, number of predicted classes and classification difficulty), some of which have limited training data. In addition, we conduct in-depth analyses of their efficacy across different training scenarios (training on the original multilingual data; on the translations into English; and on a subset of English-only data) and different languages. Our findings provide valuable insights into the applicability of the parameter-efficient fine-tuning techniques, particularly to complex multilingual and multilabel classification tasks.